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Abstract - Users’ electricity use may be shared nearly instantly with utilities companies on the smart grid due 
to Smart Meters (SMs). The data is communicated between individuals in the company, which has prompted 
privacy issues. In this paper, a privacy-preserving method for smart meter data is designed using a swarm 
intelligence algorithm. In the swarm intelligence algorithm, JAYA algorithm is taken under consideration 
and deployed for key generation in the planned method. After that, an XOR operation was performed 
between the smart meter sensitive data and key generated using a JAYA algorithm for encryption purposes. 
In MATLAB, numerous performance metrics are computed for the planned technique's performance 
evaluation. From Simulation results it is clear that the planned method gives better results as compared to the 
previous methods. 
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1. Introduction e Billing by use of precise measurement and 
management. 
Energy consumption balancing 
Communication between stakeholders 
must be two-way. 

e Increasing the ability to withstand assaults 
by malicious users. 

e Increased resiliency due to decentralized 


The term "smart grid" refers to the combination of 
information and communications technologies 
(ICT) with the conventional power system. To be 
considered smart, a smart grid must be capable of 
two-way communication among the various entities 
that make up the grid. Energy production and 
consumption may be monitored and controlled 


more effectively thanks to this method of improved control. 
management. As a whole, the smart grid is meant 7 Effective | use of resources to boost 
productivity 


to accomplish the following goals [1]: 


In spite of the advantages of the smart grid, the 


e Increased reliance on eco-friendly and : ; : 
two-way connection enables the gathering of fine- 


renewable energy sources. 
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grained information about user usage via the smart 
metre. Due to the fact that the information obtained 
may be used to infer user patterns, appliance 
categories, and overall energy use, this poses a 
significant risk of privacy violations. The 
protection of personal information is thus critical in 
a smart grid setting. Smart grid privacy and 
security may be preserved by a number of different 
strategies that have been put out in the literature. 
It's practically impossible to maintain privacy 
effectively without security since the two are so 
closely linked [2]. 


1.1 Privacy Preserving Schemes in Smart Grid 


Privacy is maintained in smart grids via a variety of 
strategies, including data aggregation and 
anonymization. In order to acquire fine-grained 
consumer energy usage data from smart metres, 
data aggregation is a common technique of doing 
so. MPC or homomorphic encryption is used to 
guarantee security and privacy needs are met 
during data aggregation. The MPC-based 
approaches enable several parties to collaboratively 
calculate a value as owned by individuals’ data 
without revealing the substance of data with the 
other involved parties. When it comes to arithmetic 
calculations, | homomorphic _ encryption-based 
approaches enable the same operation to be 
performed on encrypted text as it is on plaintext. 
Entities in the smart grid may do necessary 
calculations without understanding the data 
because of this characteristic. Using pseudonyms, 
hashing values, and other methods to obscure the 
actual identity of users is another common way to 
protect privacy in the smart grid [3]. Real-world 
consumers and their energy usage statistics are 
challenging to connect. As a result, a number of 
techniques are hybrid-based, which means they use 
more than one method to protect privacy. Similar 
approaches include temporal perturbation, Shamir's 
secret exchange, private keys and bit rotating 
methods [4]. 


1.2 Contribution 


The main focus of the work is to design a method 
for smart grid that preserving privacy. This is done 
by using swarm intelligence algorithm. Sensitive 
data and the random key is read and XOR 
operation is performed between them for security 
purposes. The random key is generated using a 
JAYA algorithm based on the objective function. 
Here Entropy is taken as the objective function in 
the key generation method. The simulation analysis 
is done in MATLAB. According to the findings, 
the new approach outperforms the current ones. 
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1.3 Paper Organisation 


The remaining paper is organized is as follows. 
Section 2 defines the related work. Section 3 
illustrates the planned method is designed to 
provide privacy in the smart grid. Section 4 shows 
the simulation results of the planned method. In 
section 5 defines the conclusion and future scope. 


2. Related Work 


In this section, related work is shown to understand 
the existing encryption algorithms are deployed for 
security purposes. 


Chen et al. [5],studied a fine-grained energy 
supply is possible due to the use of smart metres on 
the smart grid, which report power use to servers 
on a regular basis. There are, however, concerns 
about the privacy implications of the data that is 
routinely disclosed. They may, for instance, 
indicate whether or not the owner of the residence 
is at home and whether or not the television is 
operating. People are frightened of revealing this 
type of sensitive information since privacy is 
becoming a major concern. It was found that the 
conventional method continues to suffer from both 
a failure rate of the metres and an issue with their 
replacement in this study, so researchers came up 
with an idea for an improved method of collecting 
data from smart metres that relies on the noise 
addition method as well as an encryption algorithm 
called homomorphic. The experimental findings 
reveal that the calculation costs on both the 
aggregators and smart metre sides have been 
lowered after the simulations. Semantic security is 
shown through a powerful security analysis of the 
planned approach. 


Barbosa et al. [6], There are numerous reasons for 
electricity companies to acquire high-resolution 
data on energy usage from consumers via a Smart 
Metering infrastructure. However, this data 
collecting entails a great deal of detail regarding the 
energy usage of the consumers being tracked. As a 
result, a critical issue must be addressed: how can 
consumers’ privacy be protected while. still 
allowing certain services to be provided? Clearly, 
there is a tradeoff between privacy and benefit in 
this situation. There are a variety of options for 
preserving privacy, but many of them have a 
negative impact on data effectiveness or are 
computationally costly. We suggest and analyse a 
lightweight strategy based on the inclusion of noise 
for privacy and utility in this study. We also discuss 
the impact of the technique in several Smart Grid 
situations using real customer data. Finally, we 
devise and test potential assaults on our solution. 
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Eibl et al. [7],studied that with smart metres, 
consumers are concerned about their privacy 
because of the extensive data collecting. To find a 
compromise between practicality and privacy, data 
from many smart metres may be combined. 
Differential confidentiality in smart metering has 
been presented as a way to ensure the privacy of 
end-users by perturbing aggregates. Due to this, the 
practicality of differential privacy for smart 
metering in the actual world cannot be assured. 
They examine the impact of differential privacy on 
actual smart metering data, with a particular focus 
on the need to strike a balance between utility 
needs and privacy concerns. It turns out that even 
with significant changes to the fundamental 
technique, an aggregation group of thousands of 
smart metres is still required to provide good 
usefulness. 


Firorette et al. [8],Commercial and academic 
research may benefit greatly from high-fidelity 
energy networks. Since a result, these leaks create 
basic security and privacy issues, as they might 
disclose sensitive business information and reveal 
system flaws. For power networks in which 
transmission lines and transformer characteristics 
are obscured, this research examines ways to make 
the data available for analysis. Differential Privacy 
(DP), a framework that gives strong privacy 
assurances and has received a lot of attention 
recently, is used to accomplish this. Complex AC- 
infeasible networks are often the consequence of 
relying only on basic DP techniques. It is for this 
reason that this study proposes an innovative 
approach that ensures AC practicality while also 
preserving the authenticity of the obscured power 
network to some extent. In addition, the results of 
the experiments suggest that the obfuscation 
minimizes the potential harm from an attack carried 
out by exploiting the leaked dataset greatly. 


Singh et al. [9], they have planned a privacy- 
preserving method for smart grid using ANU and 
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noise addition method. In their work, the data is 
split into customer personal information and 
electricity consumption information. ANU 
algorithm is used for encrypt customer personal 
information and noise addition on _ electricity 
information by performing the perturbation 
algorithm. They achieved superior results in terms 
of execution time, avalanche effect, and memory 
consumption. 


2.1 Motivation 


In the literature, noise addition is done in the 
sensitive data to provide privacy. Noise addition is 
done by performing the data perturbation method, 
data encryption. However, generating a random key 
for data encryption is challenge. In this paper, a 
complete random key is generated using swarm 
intelligence algorithm by considering the objective 
function. 


3. Planned Method 


In this section, the planned privacy-preserving 
method is explained that is designed to secure 
smart meter data. Figure 1 shows the detail 
flowchart of the planned method. Initially, the 
smart meter dataset is read. When the dataset is 
pre-processed, sensitive attributes are separated 
from non-sensitive attributes, and the dataset is 
then divided again. The encryption algorithm is 
provided the information about the sensitive 
attributes. The encryption algorithm encrypts the 
sensitive attributes with the private key by 
performing the XOR operation and gives the 
encrypted sensitive data in the output. The private 
key is generated using the swarm intelligence 
algorithm. In the planned method, a JAYA 
algorithm is designed to generate a completely 
random key. After encryption, the encrypted 
sensitive data is concatenated with non-sensitive 
data. In the last, the performance of the planned 
method is analysed. 
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Read the Smart Meter Dataset 


| | 


Sensitive Data Non-Sensitive Data 


Key Generation using Swarm 
Intelligence Algorithm 


XOR 


Encrypted 
Sensitive Data 


| 


Concatenate Data 


| 


Performance Analysis 


Figure 1 Flowchart of the planned privacy-preserving method 


Next, a detailed description of the swarm 
intelligence algorithm is deployed for the planned 
method is explained. 


3.1 JAYA Algorithm 


At its inception, the JAYA method was designed to 
deal with optimization functions that were both 
complicated and unbounded [10]. Because it is 
derived from Sanskrit, the name JAYA literally 
translates as victory. Swarm-based intelligence and 
evolutionary algorithms are combined in this 
method, which is a population-based metaheuristic 
that incorporates the best qualities of both. As a 
result, it is inspired by natural phenomena such as 
the "survival of the fittest." This indicates that 
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options in the JAYA population are also being 
while 
simultaneously ignoring the worst solutions. To put 
it another way, the selection procedure of the 
JAYA algorithm strives to come closer to success 
by finding the global best solutions, and it wants to 
avoid failure by racing away from the worst 
answers as fast as possible. Over other population- 
based techniques, the JAYA method has various 
benefits, including being simple to build and not 
requiring any algorithm-specific parameters (i.e., 
maximum number of iterations and the population 


drawn to the best global solutions 


size). 
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Start 
7 
Initialize the parameters of the JAYA algorithm 
v 
Construct the initial population 
v 
Sort the population 
x! and x are the best and the worst solutions 
v 
i=l < 
v 
a Update the current solution x/ using Eq. (3) 
7 
No Is the new solution x 
i=i+l ~@ 3 - 
n superior than the old one x! 
y Yes 
Select the new solution 
Bd Is i equal to N? 
’ Yes 
No 
Is termination condition satisfied ? 
y Yes 
Return the best solution x! 
End 
Figure 2 JAYA Algorithm [10] 
The JAYA algorithm is comprised of six quickly and effectively. The flowchart in Fig. 2 
procedural phases. These procedural procedures depicts the processes involved in the JAYA 
may assist academics in the field of optimization algorithm's procedural steps. The pseudo-code for 
algorithms in learning how to apply this method the JAYA algorithm is also given in Table 1. 
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Table 1 Pseudocode of JAYA Algorithm [10] 


Algorithm 1 The pseudo-code of JAYA algorithm 


: Initialize the parameters of both JAYA algorithm and optimization problem (NV, T’, etc.). 


1 

2: Initialize a population of N solutions randomly. 

3: Calculate f(X;) SE ae ee ee 

4: Sort the population: (x! and x are the best and the worst solutions respectively). 
5: &1 

6: while (t < 7’) do 

he fori=1,---,N do 

8: for j = 1,--- ,Ddo 

9: Set r; €[0,1] 

10: Set r2 €[0.1] 

a & x =a25+r1 x (a} = r;\) —roxX(x 
12: end for 

13: if f(x") < f(x') then 

14: x'=x" {Update process} 

15: end if 

16: end for 

17: t=t+1 

18: end while 


The procedure of JAYA algorithm is thoroughly 
discussed in following steps: 


Step:1 JAYA method settings and optimization 
problem variables should be set to their default 
values at this point. The JAYA Algorithm's starting 
parameters are established at the run's beginning 
stage. No control parameters are required to use the 
algorithm JAYA. The population size N and the 
number of iterations T are the only two algorithmic 
parameters. Typically, an optimization issue is 
represented in this way: 


min f(x) 
S.t. 
8 (x) =¢ WF = (82.2055 n) 
h(x) <d, Wk =(1,2,...,m) 


(1) 


where f(x) is the objective function used to 
determine the fitness of the solution x = (x1, x2...., 
xD). where xi is a decision variable assigned a 
value between the minimum and maximum bounds 
such that xi € [Xmini ,Xmaxi ]. gj is the i he equality 
constraints and h, is the k inequality constraints. 
Typically, the problem variables, dimensions, and 
associated data are taken from a reference dataset. 


Step 2: Initial Population Construction for JAYA 
Algorithm:There are initial solutions (or 
populations) of JAYA algorithm that are built and 
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saved in the JAYA Memory (JM). There are N 
solutions and D dimensions in Eq. 2, hence the JM 
is an augmented matrix of the size N x D, as stated 
in Eq. 2.Conventionally, solution is randomly 
constructed: JMi,j = Xmin j + (Xmin j — Xmax j ) x 
md, Vie€ (1, 2, --- , N) AvVj € (1, 2, -: , D). The md 
function is a random generator and its value varies 
from 0 to 1. 


x x! Xp f(x') 
5 2) 
JM-= x, Xp} |fG>) 
rf iad : rel (x) 


(2) 


As a result, the JM solutions are ranked in 
ascending order based on their objective function 
values for each solution. Therefore, x! is the best 
answer, while x’ is the worst. 


Step: 3 Evaluation Process of JAYA Algorithm: 
Using the JAYA operator given in Eq. 3, the 
decision variables of all JM solutions are modified 
iteratively. 


i al ay NW tad 
x Hxtry XQ; Ixi1) 1 XQ; Ll) 


(3) 


In Eq. (3), xj' and x; denotes the new updated and 
current solution. On the the hand, rl and r2 are two 
uniform function and its values varies in the range 
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of [0-1]. In order to find the correct balance 
between exploitation and _ exploration, these 
produced random numbers are employed. x1 j is 
the best solution's decision variable j, whereas xN j 
is the worst solution's decision variable j. For the 
JAYA algorithm, the distance between the best 
solution's decision variables and the present one, 
and the gap among optimal solution's model 
parameters and current one, determines the 
diversity control. Higher exploitation occurs at a 
closer distance, whereas exploration occurs at a 
greater distance [11-12]. 


Step4: Update JM: Every iteration, the JM 
solutions will be updated. Calculate the objective 
function value of the new solution f(x'). If f(x'i) f, 
the current solution xi will be replaced by the new 
solution x'i (xi ). This procedure will be repeated N 
times. 


Step: 5 Stop rule. In order to achieve the stopping 
rule, which in certain cases is the maximum 
number of iterations T, the JAYA algorithm repeats 
Steps 3 and 4 continuously. 


4. Simulation Results 


The planned method is simulated in MATLAB. 
Further, various performance metrics are calculated 
for it, as explained below. 


4.1 Mean Square Error (MSE):This parameter is 
used to measured the error between original and 
encrypted data. It is calculated using the following 
equation. 


_ Lik (0i- ED? ( 


MSE 4) 


Table 2 shows the MSE value for different files are 
taken under consideration for the planned method. 


Table 2 MSE for the Planned Method 


Files MSE 
Filel 6701 
File2 3066.3 
File3 6764.6 
File4 5057.2 
File5 7529.6 


4.2 Peak Signal to Noise Ratio (PSNR):Peak 
Signal-to-Noise Ratio (PSNR) is utilized to 
highlight the differences between the original and 
encrypted data. The primary benefit of employing 
this measurement is determining the noise level of 
encrypted data. The PSNR values of the provided 
data are calculated using the following equation: 


2 
PSNR = 10logi)—— (5) 


In encryption scheme, low value of PSNR is 
desirable. Table 3 shows the PSNR value for 
different files are taken under consideration for the 
planned method. The result shows that the planned 
method achieves low PSNR. 


Table 3 PSNR for the Planned Method 


Files PSNR 
Filel 9.8694 
File2 13.2647 
File3 9.8284 
File4 11.0917 
Files 9.3631 


4.3 Correlation Coefficient (CC): correlation 
coefficient parameter is used to estimate the 
encrypted data quality. The correlation coefficient 
value varies from -1 to 1. The 1 value reflects the 
high correlation between original and encrypted 
data whereas 0 value reflects the low correlation 


coefficient between data. In the encryption scheme, 
low value of correlation coefficient is desirable. 
Table 4 shows the correlation coefficient value for 
different files are taken under consideration for the 
planned method. The result shows that the planned 
method achieve correlation coefficient near 0 value. 


Table 4 Correlation Coefficient for the Planned Method 
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Files CC 
Filel -0.0675 
File2 0.1716 
File3 -0.0064 
File4 0.0159 
File5 -0.3565 


4.4 Convergence Graph 


The convergence graph is plotted between fitness 
function vs. iteration. In the planned method, 
random key is generated using JAYA algorithm. 


The convergence graph for the key generation is 
shown in Figure 3. The result shows that the 
planned method gives approximate | entropy in the 
23" iteration. 


Convergence Graph for the Key Generation 


10 20 


Figure 3 Convergence Graph 


4.5 Key Generation Tests: In the literature, a 
number of key generation test is available. In this 
work, I have calculated the frequency test. The 
frequency test calculates the 0’s and 1’s probability 
in the generated key. In the ideal case, 50% 0’s and 


1’s required in the key. Table 5 shows the 
generated keys. The result shows that the planned 
method generates approximate equal number of 0’s 
and 1’s. 


Table 5 Generated Keys 


Generated Keys 0’s and 1’s Probability 
[162 119 0 94 183 176 68 88 227 63 238 66 178 49 177 | 0’s=255 
63 136133 157 218 66 57 82 195 162 187 77 2 232 179 32 | 1’s=257 
231 96 1545 132 30 237 106 216 186 164 187 194 247 229 
252 57 199 104 197221 250 55 192 154 140 170 208 183 129 
109 167 94] 
(223 192 13 216 46 137 233 119 140 79 168 254 74 19 3 | 0’s=256 
95 100231 177 193 249 247 18 232 15 5 31 137 200 182 30) 1’s=256 
10 103 42233 213 247 199 33 125 214 112 28 148 210 70 
233 95 39 92 102136 179 64 32 171 104 240 147 47 179 
72 113 163] 
4.6 Comparative Analysis 
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In this section, the planned method is compared 
with the existing method [9] based on the 
correlation and execution time parameter in Table 6 
and 7. 


The result shows that the planned method provides 
lesser correlation coefficient when compared to 


data perturbation method whereas approximate 
same correlation coefficient as compared to ANU 
algorithm. On other hand, Table 7 shows that the 
planned method takes lesser execution time for data 
encryption over the existing method. 


Table 6 Comparative Analysis based on Correlation Parameter 


Files Singh et al. [9] Planned Method 
ANU Algorithm Data Perturbation 

Filel -0.0687 -0.6074 -0.0675 

File2 -0.1574 -0.6090 0.1716 

File3 -0.0662 -0.6079 -0.0064 

File4 -0.1505 0.6108 0.0159 

FileS -0.1276 -0.6089 -0.3565 


Table 7 Comparative Analysis based on Execution Time 


Singh et al. [9] 


Planned Method 


Execution Time (in seconds) 0.42 


5. Conclusion and Future Work 


In this paper, a privacy-preserving method is 
designed for smart grid. To achieve this goal, the 
smart meter dataset is split into sensitive and non- 
sensitive part. The sensitive part is encrypted by 
performing XOR operation with a random key. The 
random key is generated using the JAYA 
algorithm. Based on the objective function a 
completely random key is generated by using 
JAVA algorithm. The performance analysis is done 
using various parameters such as MSE, PSNR, 
correlation, and key generation tests. The result 
shows that the planned method provides lower 
PSNR and correlation coefficient between original 
and encrypted data. Besides that, the planned 
method provides superior results over the existing 
methods. In the future, the dataset is post-process 
for load forecasting and bill generation purposes. 
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